Pairwise Coupling for Machine Recognition of Hand-Printed Japanese Characters

نویسندگان

  • Volker Roth
  • Koji Tsuda
چکیده

Machine recognition of hand-printed Japanese characters has been an area of great interest for many years. The major problem with this classification task is the huge number of different characters. Applying standard ”state-ofthe-art” techniques, such as the SVM, to multi-class problems of this kind imposes severe problems, both of a conceptual and a technical nature: (i) separating one class from all others may be an unnecessarily hard problem; (ii) solving these subproblems can impose unacceptably high computational costs. In this paper, a new approach to Japanese character recognition is presented that successfully overcomes these shortcomings. It is based on a pairwise coupling procedure for probabilistic two-class kernel classifiers. Experimental results for Hiragana recognition effectively demonstrate that our method attains an excellent level of prediction accuracy while imposing very low computational costs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognition of Hand-Printed Characters via Induct-RDR

The goal of character recognition research is to simplify and automate the development of character recognition algorithms. We describe here an approach based on applying preprocessing to data sets of Latin characters and then applying a machine learning approach to the data sets to build a knowledge base able to classify unseen pre-processed characters. The machine learning method, Induct/RDR,...

متن کامل

Recognition of Hand Printed Characters Based on Simple Geometric Features

Problem statement: The use of computers in information processing technology nowadays is one of the main trends of office automation. For more than four decades, information from the outside world is transferred into computers in a traditional way by keying in these raw data with the help of keyboard. Most of these data are in hand printed form and very large; therefore the use of automatic rec...

متن کامل

Survey of Pattern Recognition Approaches in Japanese Character Recognition

Optical Character Recognition (OCR) in Japanese, both handwritten and printed, is difficult to perform, owing to several reasons. Firstly, the Japanese language is comprised of over 3000 characters which can be classified as syllabic characters, or Kana, and ideographic characters, called Kanji. Secondly, Japanese text does not have delimiters like spaces, separating different words. Thirdly, s...

متن کامل

Blob Detection Technique Using Image Processing for Identification of Machine Printed Characters

Optical character recognition systems have been effectively developed for the recognition of printed characters. Optical character recognition is an awesome computer vision technique with various applications ranging from saving real time scripts digitally and deriving context based intelligence using natural language processing from the texts. One such application is the recognition of machine...

متن کامل

Precise Hand-printed Character Recognition Using Elastic Models via Nonlinear Transformation

Distorted character recognition is a difficult but inevitable problem in hand-printed character recognition. In this paper, we propose a character recognition method using elastic models for recognizing cursive characters with intricate structure. The models are fitted to unknown input patterns by applying the EM algorithm to minimize a measure of fittness. To avoid falling into local minima, m...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001